Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
نویسندگان
چکیده
Recent advances on 3D object detection heavily rely how the data are represented, i.e., voxel-based or point-based representation. Many existing high performance detectors because this structure can better retain precise point positions. Nevertheless, point-level features lead to computation overheads due unordered storage. In contrast, is suited for feature extraction but often yields lower accuracy input divided into grids. paper, we take a slightly different viewpoint --- find that positioning of raw points not essential and coarse voxel granularity also offer sufficient accuracy. Bearing view in mind, devise simple effective framework, named Voxel R-CNN. By taking full advantage two-stage approach, our method achieves comparable with state-of-the-art models, at fraction cost. R-CNN consists backbone network, 2D bird-eye-view (BEV) Region Proposal Network, detect head. A RoI pooling devised extract directly from further refinement. Extensive experiments conducted widely used KITTI Dataset more recent Waymo Open Dataset. Our results show compared methods, delivers higher while maintaining real-time frame processing rate, speed 25 FPS an NVIDIA RTX 2080 Ti GPU. The code available https://github.com/djiajunustc/Voxel-R-CNN.
منابع مشابه
Voxel Based Object Simplification
We present a simple, robust, and practical method for object simplification for applications where gradual elimination of high frequency details is desired. This is accomplished by sampling and low-pass filtering the object into multi-resolution volume buffers and applying the marching cubes algorithm to generate a multi-resolution triangle-mesh hierarchy. Our method simplifies the genus of obj...
متن کاملME R-CNN: Multi-Expert R-CNN for Object Detection
Recent CNN-based object detection methods have drastically improved their performances but still use a single classifier as opposed to ”multiple experts” in categorizing objects. The main motivation of introducing multi-experts is twofold: i) to allow different experts to specialize in different fundamental object shape priors and ii) to better capture the appearance variations caused by differ...
متن کاملTowards Voxel-Based Algorithms for Building Performance Simulation
This paper explores the design, coupling, and application of algorithms relying on grid-based or voxel-based representations of buildings. Such algorithms may encourage the adoption of building performance simulation in design practice by avoiding the timeconsuming and somewhat arbitrary process of manually partitioning a building’s interior into polyhedral spaces. They also support detailed vi...
متن کاملCascade R-CNN: Delving into High Quality Object Detection
In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends to degrade with increasing the IoU thresholds. Two main factors are responsible for this: 1) overfitting during training, due to exponentially vanishing...
متن کاملFuzzy Voxel Object
In this paper, computer vision and fuzzy set theory are merged for the robust construction of three-dimensional objects using a small number of cameras and minimal a priori knowledge about the objects. This work extends our previously defined crisp model, which has been successfully used for recognizing and linguistically summarizing human activity. The objects true features more closely resemb...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i2.16207